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Abstract 

A measurement of the frjet production cross section is presented for events containing a Z 
boson produced in pp collisions at yfs = 1.96 TeV, using data corresponding to an integrated 
luminosity of 2 fb _1 collected by the CDF II detector at the Tevatron. Z bosons are selected in 
the electron and muon decay modes. Jets are considered with transverse energy Et > 20 GeV and 
pseudorapidity \n\ < 1.5 and are identified as ftjets using a secondary vertex algorithm. The ratio 
of the integrated Z + b jet cross section to the inclusive Z production cross section is measured 
to be 3.32 ± 0.53(stat.) ± 0.42(syst.) x 1CT 3 . This ratio is also measured differentially in jet Et, 
jet rj, Z-boson transverse momentum, number of jets, and number of b jets. The predictions from 
leading order Monte Carlo generators and next-to-leading-order QCD calculations are found to be 
consistent with the measurements within experimental and theoretical uncertainties. 
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FIG. 1: Leading order Feynman diagrams for gb — > and — > production. 
I. INTRODUCTION 

The associated production of Z bosons and one or more frjets provides an important test 
of quantum-chromodynamics (QCD) calculations, for which the theoretical predictions for 
this process vary significantly [1-3]. The understanding of this process and its description 
by current theoretical calculations is important since it is the largest background, e.g., to 
the search for the standard model Higgs boson in the ZH — > Zbb decay mode [4] and to 
searches for the supersymmetric partners of b quarks [5, 6]. The process is also sensitive to 
the b quark density in the proton. A precise knowledge of the b quark density is necessary 
to accurately predict processes that strongly depend on it such as electroweak production 
of single top quarks [7] or the production of Higgs bosons within certain supersymmetric 
models [8, 9]. 

The Feynman diagrams of the contributing leading order processes gb — > Zb and qq — > Zbb 
are shown in Fig. 1. In the first two diagrams a b quark from the proton undergoes a 
hard scatter and a b quark typically remains close to the parent proton and may not be 
detected. In the third diagram the bb quark pair can be produced close to each other and 
may sometimes be reconstructed in the same jet (referred to as a "66 jet"). According to 
QCD calculations the latter diagram is predicted to account for approximately 50% of b jet 
production in association with a Z boson at the Tevatron [1]. 

Previously the integrated cross section for Z + 6 jet production has been measured with 
an uncertainty of 39% by the CDF collaboration [10]. The DO collaboration also measured 
this process assuming the ratio of the Z + 6 jet to Z + cjet cross section from next-to- 
leading order (NLO) QCD calculations [11]. The cross section of Z+jets production has 
also been measured recently by both collaborations and found to agree well with QCD 
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calculations [12, 13]. A preliminary measurement has been made of the related W + b jet 
pro cess [14]. 

In this article we present an update to the integrated Z + bjet cross section measurement 
with a substantially reduced uncertainty and for the first time differential cross section 
measurements. The measurement is made by selecting pairs of electrons or muons (dileptons) 
with an invariant mass consistent with the mass of the Z boson, Mz, and jets containing 
a displaced secondary vertex consistent with the decay of a bottom hadron. Contributions 
from the decay of known heavy particles (such as Z or top quarks) to b hadrons are not 
included in our definition of the cross section and are subtracted from the data. 

The light and charm jets (i.e., jets that do not contain a b hadron) remaining after 
this selection are discriminated from frjets using the invariant mass of all charged particles 
associated to the secondary vertex, exploiting the large mass of the b quark compared to 
the other partons. Throughout this article we use Z to denote any dilepton events due to 
Z or 7* production with an invariant mass 76 < M tt < 106 GeV/c 2 , albeit the contribution 
of virtual photons is predicted to be below 1% of the Z production rate [15]. 

We use data collected by the CDF II detector at the Tevatron pp collider between February 
2002 and May 2007, corresponding to an integrated luminosity of 2.0 fb _1 . 

II. THE CDF II DETECTOR 

The CDF II detector is described in detail elsewhere [16] and consists of a precision 
tracking system, electromagnetic and hadronic calorimeters and muon spectrometers. The 
tracking detector is coaxial with the beam-pipe and consists of silicon strip detectors [17] 
surrounded by a wire drift chamber (COT) [18] inside a 1.4 T magnetic field provided by 
a solenoid. The solenoid is surrounded by electromagnetic [19, 20] and hadronic calorime- 
ters [21] that use lead and stainless steel as absorber materials, respectively, and scintilla- 
tors as active material. Inside the electromagnetic calorimeter a proportional strip and wire 
chamber is embedded at about 6 radiation lengths, providing an accurate position measure- 
ment [22]. The muon detectors [23] surround the calorimeters and consist of wire chambers 
and scintillators. Gas Cerenkov counters, located close to the beampipe, are used to measure 
a fraction of the inelastic event rate and thereby the collider luminosity [24]. 

A cylindrical coordinate system is used in which the z axis is along the proton beam 
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direction and 9 is the polar angle. The pseudorapidity is defined as r] = — lntan(0/2), while 
the transverse momentum is given by pt = p sin 9 and the transverse energy by Et = E sin 9. 
Missing transverse energy, is defined as the magnitude of — EjE^raj, where E % T is the 
transverse energy deposited in the i th calorimeter tower and hi is a unit vector pointing from 
the beamline to the ith tower in the azimuthal plane. 

III. MEASUREMENT PROCEDURE 

In this article we present a measurement of the ratio of the cross section for Z + b jet pro- 
duction to the inclusive Z production cross section. Measuring the ratio has the advantage 
that several uncertainties, e.g., on the integrated luminosity and on the lepton identification, 
largely cancel. We present both per jet and per event cross section ratios. The per jet cross 
section ratio is proportional to the number of b jets, while the per event cross section ratio 
is proportional to the number of events with one or more 6 jets. The per jet cross section 
selection efficiency is independent of the number of frjets in the event as it is proportional 
directly to the efficiency of identifying a b jet, e&j et , and thus has a smaller overall error. For 
certain measurements, such as the cross section ratio as a function of the number of jets, it is 
preferable to use an event based definition. In this case the event selection efficiency depends 
on the number of 6 jets in the event: for events with one b jet it is also simply proportional 
to e&j C t while for events with two b jets the efficiency for, e.g., identifying at least one b jet is 
proportional to e fejct • (2 - e bjet ). 

The cross section ratios are determined by 



where N(Z) is the number of events in the data with a Z boson and e(Z) is the efficiency 
x acceptance for the Z boson selection within the Mu range of this analysis. For the per 
jet cross sections, a ]Ct (Z + b jet), the number of estimated 6 jets in data for events with a Z 
boson is N ]Ct (Z + b]et). For the per event cross sections, a cvt (Z + N b j et ), the number of data 
events with a Z boson is N cvt (Z + N b j Ct ), with each event having the number of b jets equal to 
iV fe jet. All quantities are quoted after background subtraction. The quantities e iel {Z + 6 jet) 



a jet(Z + b jet) Ni ct (Z + b jet) /N(Z) 

a(Z) ~ ei ct (Z + 6jet)/e(Z) 
a cvt (Z + N b j Ct ) N cvt (Z + N bict )/N(Z) 
a(Z) e^(Z + N biet )/e(Z) ' 



(2) 



(1) 
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and e cvt (Z + A*6j Ct ) are the corresponding efficiency x acceptances for the Z + b jet and the 
Z + iVftjet selections, respectively. 

In the following, the selection and the methods for determining the efficiencies and the 
number of b jets are described. The Monte Carlo simulation is tuned to reproduce the trigger, 
lepton, and b jet efficiencies as measured in the data, and is used to correct the data for all 
detector effects such as acceptance losses, efficiencies and resolutions. 

IV. MONTE CARLO SIMULATION 

We use pythia [2] and alpgen [3] as the main Monte Carlo generators. For the pythia 
generation the inclusive Drell-Yan process for Z production is used, and jets are generated 
via the parton shower. This process includes matrix-element inspired corrections to the 
parton shower to better describe the px distribution of the Z boson [25]. alpgen calculates 
the leading order (LO) matrix elements separately for each parton emission and then matches 
to a parton shower from pythia. Double-counting between the matrix-element calculations 
and the parton-showers is avoided by using the MLM matching procedure [26]. For both 
pythia and alpgen the CTEQ5L [27] structure function is used for the parton distribution 
functions and "Tune A" is used for the underlying event [28, 29]. For the modeling of c and 
frjets a combination of pythia and alpgen is used: the samples are averaged using equal 
portions of both since this gives the best description of the Et and r\ distribution of the 
frjets. For the description of light jets we use only the pythia sample which gives a good 
description of inclusive Z+jet production at low jet multiplicities that are relevant for this 
analysis. The decays of b hadrons are performed using evtgen [30]. The generated events 
are passed through the GEANT3-based [31] CDF detector simulation [32], and thereafter 
reconstructed and analyzed in the same way as the data. 

V. EVENT SELECTION 

Z boson candidates are identified in events with a dilepton pair which have an invariant 
mass Mu between 76 and 106 GeV/c 2 where t — e,//. 

Events in the electron channel are triggered online by either one central {\v\ < 1.1) 
electromagnetic calorimeter cluster with E T > 18 GeV and a track with p T > 9 GeV/c 
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associated to it, or by two electromagnetic clusters with E T > 18 GeV and \r]\ < 3.2, where 
no track association is required. These requirements are also made in the offline analysis. 
Furthermore, all central electrons are required to have E T > 10 GeV and a matched track 
with pt > 5 GeV/c, and all forward electrons are required to have Et > 18 GeV. For 
forward electrons (1.1 < \r)\ < 3.2) a track requirement is not imposed unless both electrons 
are forward, in which case they are required to have Et > 25 GeV and a matched track with 
Pt > 10 GeV/c. The electrons also have to pass certain quality criteria to verify that they 
are consistent with the electromagnetic shower characteristics expected for electrons [15]. 

Events in the muon channel are triggered on at least one muon candidate that has a signal 
in one of the muon chambers with < 1.0 and Pt > 18 GeV/c. In the offline analysis the 
second muon candidate is not required to have a signal in the muon chambers but it must 
have hits in the COT which reduces the acceptance to < 1.5. All muons are required to 
have calorimeter energy deposits consistent with those expected from a minimum ionizing 
particle [15]. 

All leptons are required to be isolated from other particles in the event by a distance 
of AR = v/(A0) 2 + (Ar]) 2 > 0.4, and at least one of the two muons (electrons) must have 
Pt > 18 GeV/c (Et > 18 GeV) while the second one is only required to have pt > 10 GeV/c 
(Et > 10 GeV). However, for dielectron events where the two electrons are in the forward 
calorimeter, both are required to have E T > 18 GeV to match the trigger requirements. 

In order to reduce the background from particles that fake electrons or muons the two 
leptons in each event are required to have opposite charge. This cut is not applied in the 
electron channel if one or both electrons are forward, since the charge determination is not 
very precise in this region of the detector [33]. 

Using this selection we observe 193,749 Z — > e + e _ and 101,967 Z — > fi + fi~ candidates. 
The selection has e(Z) = 41% for Z -> e + e~ and e(Z) = 23% for Z -> events. 

Jets are selected using a cone based algorithm with a cone size of AR = 0.7 [34]. This 
choice of cone size has the advantage over smaller cone sizes in that the hadronization 
corrections are smaller. The jets are measured in the calorimeter and corrected to the 
hadron level [35], i.e., they are corrected for the CDF calorimeter response and multiple pp 
interactions. Note that the jets are not corrected for the underlying event (underlying event 
correction) or any changes in the energy contained within the jet cone due to fragmentation 
and any energy loss due to out-of-cone parton radiation (hadronization correction). In order 
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to compare to parton level calculations, these additional corrections are determined and 
applied to the theoretical calculation as described later. We observe 29,363 Z — > e + e _ and 
18,087 Z — > candidate events with at least one jet with E T > 20 GeV and \rj\ < 1.5. 

A b jet is defined at the hadron level as any jet that contains a b hadron within its cone. A 
secondary vertex algorithm is used to identify b jets based on tracks with p T > 0.5 GeV/ c that 
are displaced from the primary vertex, exploiting the relatively long lifetime of b hadrons, 
as described in detail in Ref. [36]. Jets with a reconstructed secondary vertex are denoted 
as "tagged" jets. The b tagging efficiency varies between 30% and 40% in the E T range 
relevant for this analysis, and has been measured using data with an uncertainty of 5.3%. 
The algorithm also tags about 8% of c jets and 0.5% of light jets as determined with Monte 
Carlo simulation. 

A sign is assigned depending on whether the secondary vertex is in the same hemisphere as 
the jet (positive tag) or in the opposite hemisphere (negative tag). For b jets the direction 
of the vertex tag is aligned with the jet direction generally yielding a positive tag, while 
for misreconstructed secondary vertices from light jets the two directions are uncorrected, 
yielding similar amounts of negative and positive tags. Since the jets with negative tags are 
used in the fit to determine the fraction of b jets (see section VII), it is necessary to verify 
that the ratio of negatively to positively tagged light jets in the Monte Carlo reproduces 
that in the data. The ratio has been measured in inclusive jet production as 0.65 ± 0.07, in 
good agreement with the simulation value of 0.62. 

Events are rejected if $ T > 25 GeV and the sum of the transverse energies of all jets, lep- 
tons and $ T is greater than 150 GeV. These cuts reduce the background from tt production 
by a factor 10, while retaining 99% of the signal. 

The efficiency of this selection is e set (Z + 6 jet) = 8.7%. In the data we observe 648 
positively tagged jets and 151 negatively tagged jets. There are nine events that contain 
two tagged jets. For these events all tags are found to be positive. 

The sample of tagged events contains a small amount of background from known processes 
which have a true b jet and a larger background contribution from events where a c jet or a 
light jet has produced a secondary vertex tag. These backgrounds are discussed in the next 
two sections. 
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VI. BACKGROUND CONTRIBUTIONS 



The most important backgrounds that contain a true b jet arise from ZZ and tt pro- 
duction and from processes where one or two jets are misidentified as a lepton. This latter 
contribution arises mainly from W+jets events where one jet is misidentified and multi-jet 
production where two jets are misidentified. Background from non-6 jets is discussed in 
section VII. 

The ZZ and tt backgrounds are determined using pythia Monte Carlo simulation. The 
ZZ Monte Carlo simulation is normalized to the NLO QCD cross section calculation of 
37.2 fb [37], which is the part of the cross section where both Zs are in the mass range 
76 < M z < 106 GeV/c 2 and where one Z decays to any of the charged leptons and the 
other to bb. The Monte Carlo simulation also produces events for 7*/Z production outside 
this mass range and for all standard model decays. The tt cross section is taken from NLO 
QCD as 6.7 pb [38]. We estimate an uncertainty on these backgrounds of 20%, which takes 
into account the uncertainty in the theoretical prediction for the production cross section 
and in the experimental acceptance for our analysis. Backgrounds from Z — > t + t~ and WW 
production were also studied. Both were found to be small, with Z — > t + t~ contributing 
0.3 and WW contributing < 0.01 to the number of tagged jets. 

The backgrounds due to jets being misidentified as leptons are determined using the 
data. For the dielectron channel a "fake rate" method is used where the fraction of jets 
misidentified as electrons is measured in inclusive jet samples and then applied to the jets 
in a sample of data events with one reconstructed electron. This technique is described in 
more detail in Refs. [10] and [39]. The uncertainty on this background is estimated at 50%, 
using the agreement between data and Monte Carlo simulation in the sidebands of the Z 
mass distribution. For the dimuon channel we use events in which both muons have the 
same electric charge since the chance of faking a muon is assumed to be charge independent. 
The statistical error on this number of events is used as the uncertainty. The resulting 
background estimates for the number of tagged jets (including both positive and negative 
tags) are shown in Table I. 

The dilepton invariant mass is shown in Fig. 2 for events with at least one positively tagged 
jet for the data, the pythia signal Monte Carlo sample, and the background processes. The 
signal Drell-Yan Monte Carlo sample is normalized such that the expectation equals the 
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TABLE I: Estimated numbers of background tagged jets (positive and negative) for the e + e and 
the fj, + fi~ channels. The uncertainties include both statistical and systematic errors. 

Background source e + e~ fi + fi~ 
ZZ 6.5 ±1.3 4.3 ±0.9 

ti 1.3 ±0.3 1.4 ±0.3 

Z^t + t-/WW 0.2 ±0.1 0.1 ±0.1 
fake lepton 16.4 ± 8.2 5.0 ± 2.2 



number of data events in the range 76 < M u < 106 GeV/c 2 . The shape of expectation agrees 
well with the data distribution both in the peak, where the Drell-Yan signal dominates, and 
in the tails, where the background is significant. 

VII. DETERMINATION OF THE FRACTION OF b JETS 

After the selection described in section V the sample of tagged jets contains a significant 
fraction of light and charm jets. Since the Z + c jet cross section in the data is unknown and 
the simulation may not accurately describe the rate of light jets that are reconstructed with 
a secondary vertex, the fraction of 6jets in the data is determined using a likelihood fit of 
the invariant mass distribution of the tracks forming the secondary vertex Msvtx- Due to 
the different masses of the quarks this distribution enables a good discrimination between 
light, c, and b jets. It should be noted that these distributions are affected, particularly 
for low values, by the minimum track px requirement and the efficiency of the algorithm to 
correctly assign tracks to the secondary vertex. 

The fit is performed for M S vtx < 3.5 GeV/c 2 where the data have reasonable statistics. 
The Z±jets Monte Carlo simulation and the simulation for the background processes, apart 
from fake leptons, are used to make templates for the shape of the M S vtx distributions for 
light, c and b jets. The template for the shape of the background from fake leptons is taken 
from the data. The normalization of each background is fixed (see section VI), while the 
normalizations of the light, c, and b components of the Z±jets are free parameters of the fit. 

This fit is done simultaneously for positively and negatively tagged jets. We include the 
negatively tagged jets in the fit since this results in a reduced uncertainty on the number of 
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FIG. 2: Dilepton invariant mass for events with at least one positive secondary vertex tag. The 
data (points) are shown together with the Drell-Yan Monte Carlo (open histogram) and the sum 
of the background contributions (filled histogram). The range where the data are selected for this 
analysis is indicated by arrows. 

light and c jets, although the effect on the uncertainty for the number of b jets is marginal. 
The resulting fit is shown in Fig. 3. It is seen that for the positively tagged jets the fejets 
populate the higher M S vtx values due to the large b quark mass, allowing them to be 
discriminated from the light and charm background that is concentrated at low Msvtx- 
The negatively tagged jets are mainly populated by light jets, which are thus constrained by 
including this distribution in the fit. The fit yields the number of positively tagged b jets as 
Nb = 270 ± 43. The correlation coefficient between the number of b and c jets is —0.78 and 
between the number of frjets and light jets is +0.22. As a consistency check, when fitting 
only the positively tagged jets, the result of = 273 ± 44 is consistent with the default fit. 

This technique for estimating the number of fejets is used for the integrated and all 
differential cross section measurements except for the Z + 2b jets measurement for which the 
statistics are too low to use this fit procedure. Since the background for double-tagged events 
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FIG. 3: Invariant mass of tracks at the secondary vertex for (a) positively and (b) negatively tagged 
jets. Shown are the data (points) and the fitted contributions of light, c and b jets. Also shown is 
the background contribution from Z+light jets, Z + c jets, and from other processes with b jets. 
The fitted number of light jets (JVj), c jets (N c ), b jets (-/V&), and the number of background events 
(Nb g ) is also shown. 



from c and light flavor jets is predicted from Monte Carlo to be only 0.79 events, compared 
to 9 observed data events we simply subtract those components using the prediction. 
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VIII. SYSTEMATIC UNCERTAINTIES 



There are several sources of systematic uncertainty that are all evaluated separately for 
the integrated Z + b jet cross section and for each bin of the differential measurements. 
Table II lists each source of systematic uncertainty and its effect on the integrated cross 
section ratio. 

TABLE II: The systematic uncertainties on the measurement of the ratio a iCt (Z + bjet)/a(Z). The 
total systematic uncertainty is estimated by adding the individual uncertainties in quadrature. 



Source of Uncertainty 


Uncertainty (%) 


MC dependence 


8.0 


MC if ct dependence 


2.8 


track finding efficiency 


5.7 


b quark fragmentation 


0.8 


bb/b, cc/c jet fractions 


3.8 


light jet template 


1.7 


6-tagging efficiency 


5.3 


jet energy scale 


2.4 


misidentified lepton background 


1.9 


other backgrounds 


0.8 


total 


12.7 



The largest systematic uncertainties arise from the Monte Carlo modeling of the 6 jet Et 
distribution and the shape of the templates used for the extraction of the b jet fraction. 

The uncertainty on the E T dependence of frjets in Z + & jet production is estimated 
directly from the data by reweighting the shape of the simulated distribution maintaining 
consistency with the data at the la level. These variations are of a similar magnitude as 
the differences between the pythia and alpgen generators. The resulting uncertainty is 
8.0%. The same technique is used to estimate a systematic uncertainty due to the modeling 
of the jet ?] distribution of 2.8%. 

Systematic uncertainties on the shape of the M S vtx templates are estimated by varying 
the track finding efficiency by 3%, by changing the b quark fragmentation function, and 
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by varying the fraction of bb to frjets (and cc to cjets) between zero and three times their 
default values in the simulation. These result in cross section uncertainties of 5.7%, 0.8%, 
and 3.8%, respectively. For the light jet template the relative contribution of negative tags 
with respect to positive tags is varied by 25%, resulting in a cross section uncertainty of 1.7%. 
The 6-tagging efficiency uncertainty of 5.3% results in an uncertainty on the cross sections 
with one b jet of 5.3% and on those with two b jets of 10.6%. Additional uncertainties arise 
from the jet energy scale [35] (2.4%) and the backgrounds as described in section VI (2.0%). 

All these uncertainties apply to the ratio of the Z + 6 jet to the Z cross section. For 
the Z + 6 jet cross section itself, additional uncertainties apply due to the uncertainty on 
the integrated luminosity (5.8%) [40] and on the CDF measurement of the Z cross section 
(1.8%) [15]. 

While the per-jet cross section is independent of the Monte Carlo model used for the 
number of b jets in each event, the event based cross sections depend on the assumption on 
this number. We estimate a systematic uncertainty on the ratio of events with two 6jets to 
one b jet of 30% as determined from the measurement of the cross section ratio for one and 
two frjets presented in section IX. This results in an additional uncertainty of up to 4.7% 
on any event based cross section. 

For the measurement of a eyt (Z + 2b jets) /a(Z) an uncertainty of 100% on the c and light 
jet backgrounds is taken, resulting in an uncertainty of 12% on the cross section ratio. 

IX. RESULTS 

In this section the integrated and differential measurements for the Z + bjet cross section 
divided by the inclusive Z cross section are presented. Also shown are the integrated cross 
section and the integrated cross section divided by the Z+jet cross section. 

The measurements are compared to the leading-order QCD Monte Carlo generators 
pythia and alpgen, and to the next-to-leading order calculations as implemented in 
mcfm [1]. The QCD calculations are always performed in the same kinematic range as 
the data. 

The mcfm calculation is performed at order a 2 . The gb — > Zb is dependent on the b quark 
density, and is performed at next-to- leading order in a s . The other processes contributing 
at order a 2 are the final states Zbg and Zbb, which are calculated at leading order. The b 
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quarks are treated as massless throughout, except in the contribution qq — > Zbb where the 
quark mass is required in order to render the calculation finite. The NLO corrections are 
known to substantially increase the cross section for the gb — > Zb process and to decrease its 
dependence on renormalization and factorization scales. For the qq — > Zbb process no full 
NLO calculations are available for the case where only one b jet is observed. This leads to a 
substantial uncertainty on the cross section as discussed below. For the results presented here 
two predictions are compared: Q 2 = m| + p\ z and Q 2 = (^f=t Pt,i) / Nj e t — (prjet)- The 
same scales are used for the renormalization and factorization scale for the two predictions. 

ALPGEN is a tree-level generator where the partonic initial and final states are showered 
using PYTHIA. In the evaluation of the matrix elements, ALPGEN treats b quarks as massive, 
and therefore b quarks cannot be considered as parts of the partonic density of the proton. 
The inclusive Z + b final state emerges in ALPGEN as part of the full gg — > Zbb process after 
summing over the full phase-space of the b quark. The result is dominated by configurations 
where one of the initial-state gluons splits into a bb pair: here, the b quark enters the hard 
scattering with the second gluon leading to the Z + b final state, and the b typically has 
small transverse momentum and large rapidity. For ALPGEN the default renormalization 
and factorization scales of Q 2 = m z + z are used. 

pythia includes the bb — > Z process as part of the generic qq—^Z process, with a cross 
section related to the parton density of the incoming bb quarks evaluated at the Z mass 
scale. Then, by backwards evolution of the initial-state cascade, two branchings g — > bb 
are constructed to promote the original process to gg — > Zbb, where the scale is set by the 
transverse momentum (in some approximation) of each branching on its own. In addition 
further partons may be emitted in the cascade, and one may also have light quark qq — > Z 
processes where final-state g — > bb branchings gives Zbb topologies. 

For mcfm the CTEQ6M [41] parton distribution functions are used, while for pythia 
and alpgen the CTEQ5L set is used. 

Corrections for the underlying event and the hadronization (see section V) are determined 
using the pythia Monte Carlo and applied to the mcfm prediction. The underlying event 
correction is determined by taking the difference in the cross section with and without the 
underlying event switched on. The hadronization correction is determined by taking the 
difference in the cross section at the parton and hadron level. For a iCt (Z + bjet)/a(Z) 
and the integrated Z + b jet cross section, these corrections result in a net increase of the 
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predicted cross section by 8% with contributions of — 1% from hadronization and +9% from 
the underlying event. For the cross section ratio of Z + b jet to inclusive Z+jet production 
the correction is +4% with contributions of +9% from hadronization and —5% from the 
underlying event. The correction factors for the differential cross section ratios are given 
below. 

The ratio of the integrated Z + b jet cross section for E^ sct > 20 GeV and \r] b ^ ct \ < 1.5 to 
inclusive Z production, for 76 < M u < 106 GeV/c 2 , is measured as 

<7^( Z + b iet") 

1 —r J J = (3.32 ± 0.53(stat.) ± 0.42(syst.)) x 10" 3 . 

a{Z) 

This measurement is proportional to the number of b jets. The NLO QCD prediction of 
mcfm is 2.3 x 10~ 3 for Q 2 = m| + p%z and 2.8 x 10~ 3 for Q 2 = (p^ jet ). The prediction 
of alpgen is 2.1 x 10 -3 and pythia predicts 3.5 x 10~ 3 . The difference between the two 
mcfm predictions shows that there is a rather large theoretical uncertainty for this process. 
The reason for the large difference between alpgen and pythia is primarily due to the use 
of different scales since alpgen uses a large scale while pythia's scale is approximately the 
jet pt- The data are better described by a low choice of scale. 

The ratio of Z + bjet to inclusive Z+jet production is determined as (2.08 ±0.33 ±0.34)% 
compared to predictions of 1.8% (mcfm, Q 2 = m 2 z +p^ z ), 2.2% (mcfm, Q 2 = (p 2 ^)), 1-5% 
(alpgen), and 2.2% (pythia). The Z+frjet cross section is determined to be a ]ct (Z+b]et) = 
0.85±0.14(stat.)±0.12(syst.) pb by multiplying the ratio with the measured inclusive Z cross 
section from CDF of 254.9 ± 16.2 pb [15]. This technique means there is an implicit (small) 
extrapolation from the measurement range 76 < Mu < 106 GeV/c 2 presented in this article 
to the region 66 < Mu < 116 GeV/c 2 , where the inclusive Z cross section measurement was 
made. 

Table III gives the differential results for the ratio of the Z + b jet cross section to the 
inclusive Z production cross section versus the E^ ct and r] biCt . These measurements are 
proportional to the number of b jets. Table IV lists the differential cross section ratios 
versus p% , together with the ratio for one and two jets and for one and two b jets. These 
measurements are proportional to the number of events. Also included in Tables III and IV 
is the correction factor Chad that needs to be applied to parton level calculations to correct 
for the underlying event and hadronization. 

Figs. 4, 5, and 6 show the data compared to the mcfm prediction versus E^ et and rj biet , 
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TABLE III: The ratio of the Z + 6 jet to the inclusive Z cross section versus E T )e (normalized 
per GeV) and rj b ^ et (normalized per unit in pseudorapidity). The statistical uncertainty is listed 
first and the systematic uncertainty is listed second. The correction factor Chad that needs to be 
applied to parton level calculations to correct for the underlying event and hadronization is also 
given. 



E b J ct (GeV) 


a^{Z + b jet)/ 'a(Z) x 10 4 (GeV)" 1 


Chad 


[20, 35] 


1.42 ±0.28 ±0.15 


1.03 


[35, 55] 


0.25 ±0.10 ±0.03 


1.13 


[55, 100] 


0.122 ±0.043 ±0.019 


1.22 


|^&jet| 


ai ct {Z + 6jet)/ff(Z) x 10 3 


Chad 


[0.0,0.5] 


2.44 ±0.57 ±0.28 


1.13 


[0.5,1.0] 


2.90 ±0.65 ±0.39 


1.03 


[1.0,1.5] 


0.79 ±0.50 ±0.14 


1.05 



versus the number of jets and b jets, and versus pf , respectively. The mcfm predictions are 
shown for two different values for the renormalization and factorization scale. 

It is seen that the theoretical cross section prediction depends on the choice of scale, and 
differences up to a factor of two are seen, e.g., in the A/j et distribution. Both predictions 
describe the data but the lower scale choice is favored. 

Figs. 7, 8, and 9 show the data compared to the ALPGEN and pythia Monte Carlo 
programs. Large differences are observed between the two programs, in particular at low 
Ep et and low pf and at low jet multiplicity. We have verified that this difference is reduced 
if we use a lower scale for alpgen but present here only the default used commonly by 
hadron collider experiments. In general pythia describes the data better than alpgen. 
Both MC programs describe the data well at high E^ et and and for jet multiplicities of 
two. 

All predictions are generally in agreement with the data, but differences of up to la 
are observed in the integrated cross section between the data and the mcfm calculation, 
depending on which scale is used. The large spread of the theoretical predictions suggests 
that higher orders in the QCD calculation may be important for this process. 
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TABLE IV: The ratio of the Z + b jet to the inclusive Z cross section versus the number of jets, 
the number of b jets, and the pt of the Z boson for events with at least one 6 jet. In all cases the 
measurement is restricted to E^ et > 20 GeV and \r] b ^ et \ < 1.5. The statistical uncertainty is listed 
first and the systematic uncertainty is listed second. The correction factor Chad that needs to be 
applied to parton level calculations to correct for the underlying event and hadronization is also 
given. 



pf (GeV/c) 


a evtr z+ > ibjeb)/ a (Z) x 10 5 (GeV/cC 1 


Chad 


[0,20] 


4.6 ± 1.4 ±0.7 


1.25 


[20, 35] 


7.0 ±1.9 ±0.8 


1.09 


[35, 55] 


2.9 ±1.0 ±0.2 


0.93 


[55, 100] 


1.11 ±0.39 ±0.15 


1.14 




a evt (Z+ > lbjet)/a(Z) x 10 3 


Chad 


1 


2.23 ±0.42 ±0.28 


1.07 


2 


0.78 ±0.22 ±0.09 


1.12 


iVbjet 


a cvt (Z + N b]et )/a(Z) x 10 3 


Chad 


1 


2.75 ±0.44 ±0.38 


1.07 


2 


0.22 ±0.11 ±0.05 


1.09 



X. CONCLUSION 

In conclusion, we have measured the ratio of Z+bjet production to inclusive Z production 
using the CDF II detector at the Tevatron, and compared to previous measurements the 
uncertainty has been reduced to 20%. For the first time we have presented differential 
measurements as a function of the kinematics of the jets and Z boson and the number of 
jets in the event. These measurements enable the NLO QCD prediction to be tested over 
a wide range of final state observables. Large variations are seen between the theoretical 
predictions as no full NLO QCD calculation is available for this process. The predictions 
generally describe the data, but the agreement is better for those predictions that use a low 
value for the renormalization and factorization scales. 
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